Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
n | 1481 | 82 | 2 | 41.0000 |
Door | 4400 | 319 | 8 | 39.8750 |
Dus | 810 | 32 | 1 | 32.0000 |
Zij | 396 | 31 | 1 | 31.0000 |
De | 12642 | 816 | 28 | 29.1429 |
Wel | 340 | 23 | 1 | 23.0000 |
Verder | 367 | 22 | 1 | 22.0000 |
Toch | 276 | 21 | 1 | 21.0000 |
Hier | 257 | 21 | 1 | 21.0000 |
Daar | 648 | 38 | 2 | 19.0000 |
Zijn | 424 | 18 | 1 | 18.0000 |
Of | 691 | 36 | 2 | 18.0000 |
Ook | 1813 | 53 | 3 | 17.6667 |
Zo | 912 | 52 | 3 | 17.3333 |
Dan | 799 | 50 | 3 | 16.6667 |
Maar | 2645 | 82 | 5 | 16.4000 |
En | 3191 | 97 | 6 | 16.1667 |
Overigens | 233 | 16 | 1 | 16.0000 |
Toen | 359 | 16 | 1 | 16.0000 |
Daarbij | 186 | 15 | 1 | 15.0000 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
kun | 562 | 2 | 25 | 0.0800 |
dec | 62 | 1 | 11 | 0.0909 |
vorm | 161 | 1 | 11 | 0.0909 |
buitenlandse | 116 | 1 | 10 | 0.1000 |
album | 105 | 1 | 9 | 0.1111 |
kamer | 80 | 1 | 8 | 0.1250 |
eenvoudig | 70 | 1 | 8 | 0.1250 |
juli | 643 | 5 | 38 | 0.1316 |
miljard | 336 | 5 | 36 | 0.1389 |
bekende | 118 | 1 | 7 | 0.1429 |
meneer | 114 | 1 | 7 | 0.1429 |
vorig | 389 | 3 | 20 | 0.1500 |
volgend | 169 | 2 | 13 | 0.1538 |
z | 320 | 2 | 13 | 0.1538 |
augustus | 460 | 6 | 37 | 0.1622 |
december | 569 | 6 | 37 | 0.1622 |
arm | 67 | 1 | 6 | 0.1667 |
I | 77 | 1 | 6 | 0.1667 |
homo | 78 | 1 | 6 | 0.1667 |
ton | 63 | 1 | 6 | 0.1667 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II